Few-shot symbol classification via self-supervised learning and nearest neighbor
نویسندگان
چکیده
The recognition of symbols within document images is one the most relevant steps involved in Document Analysis field. While current state-of-the-art methods based on Deep Learning are capable adequately performing this task, they generally require a vast amount data that has to be manually labeled. In paper, we propose self-supervised learning-based method addresses task by training neural-based feature extractor with set unlabeled documents and performs considering just few reference samples. Experiments different corpora comprising music, text, symbol report proposal tackling high accuracy rates up 95% few-shot settings. Moreover, results show presented strategy outperforms base supervised learning approaches trained same that, some cases, even fail converge. This approach, hence, stands as lightweight alternative deal classification annotated data.
منابع مشابه
Meta-Learning for Semi-Supervised Few-Shot Classification
In few-shot classification, we are interested in learning algorithms that train a classifier from only a handful of labeled examples. Recent progress in few-shot classification has featured meta-learning, in which a parameterized model for a learning algorithm is defined and trained on episodes representing different classification problems, each with a small labeled training set and its corres...
متن کاملFew-shot Classification by Learning Disentangled Representations
Machine learning has improved state-of-the art performance in numerous domains, by using large amounts of data. In reality, labelled data is often not available for the task of interest. A fundamental problem of artificial intelligence is finding a representation that can generalize to never seen before classes. In this research, the power of generative models is combined with disentangled repr...
متن کاملGS4: Generating Synthetic Samples for Semi-Supervised Nearest Neighbor Classification
In this paper, we propose a method to improve nearest neighbor classification accuracy under a semi-supervised setting. We call our approach GS4 (i.e., Generating Synthetic Samples Semi-Supervised). Existing self-training approaches classify unlabeled samples by exploiting local information. These samples are then incorporated into the training set of labeled data. However, errors are propagate...
متن کاملWeighted Nearest Neighbor Classification via Maximizing Classification Consistency
The nearest neighbor classification is a simple and effective technique for pattern recognition. The performance of this technique is known to be sensitive to the distance function used in classifying a test instance. In this paper, we propose a technique to learn sample weights via maximizing classification consistency. Experimental analysis shows that the distance trained in this way enlarges...
متن کاملSemi-Supervised Few-Shot Learning with Prototypical Networks
We consider the problem of semi-supervised few-shot classification (when the few labeled samples are accompanied with unlabeled data) and show how to adapt the Prototypical Networks [10] to this problem. We first show that using larger and better regularized prototypical networks can improve the classification accuracy. We then show further improvements by making use of unlabeled data.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition Letters
سال: 2023
ISSN: ['1872-7344', '0167-8655']
DOI: https://doi.org/10.1016/j.patrec.2023.01.014